home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
The World of Computer Software
/
The World of Computer Software.iso
/
ts1st20.zip
/
STATMEAS.INF
< prev
next >
Wrap
Text File
|
1992-09-22
|
15KB
|
378 lines
- 1 -
Tue 22-September-1992 (All rights reserved)
About TS1ST in General (Statistical univariate measures)
======================
Apply question mark ? with the program call for a brief description
of a program.
This package may be used and distributed freely for NON-COMMERCIAL,
NON-INSTITUTIONAL, PRIVATE purposes, provided it is not changed in
any way.
┌────────────────────────────────────────────────────────────────┐
│ For ANY other usage (such as use in a business enterprise or a │
│ university) or the full scale version contact the author for a │
│ personal or a site license. │
└────────────────────────────────────────────────────────────────┘
Please do not distribute any part of this package separately.
Uploading to BBSes is encouraged.
The registered version is strictly for the registrant only.
Identical programs must NOT be running on more than one computer at
a time. Site licensed programs must not be run outside the licensed
site.
The programs are under development. Comments and contacts are
solicited. If you have any questions, please do not hesitate to use
electronic mail for communication.
InterNet address: ts@uwasa.fi (preferred)
Bitnet address: SALMI@FINFUN
The author shall not be liable to the user for any direct, indirect
or consequential loss arising from the use of, or inability to use,
any program or file howsoever caused. No warranty is given that the
programs will work under all circumstances.
Timo Salmi
Professor of Accounting and Business Finance
Faculty of Accounting and Industrial Management
University of Vaasa
P.O. BOX 297, SF-65101 Vaasa, Finland
CONTENTS:
1. Acknowledgments
2. General Description
3. Formulae
4. The Other Statistical Programs in the Set
5. The Statistics Set for Other Computers
6. Release Notes for Statmeas
7. List of the files in the package
- 2 -
1. ACKNOWLEDGEMENTS
Useful help by the following colleagues in testing STATMEAS and
discussing the statistical concepts is gratefully acknowledged:
Associate Professor Martti Luoma, Lecturer Antti Kanto, and
Assistant Markku Malkamäki.
2. GENERAL DESCRIPTION
STATMEAS (Ver 2.0)
STATistical MEASures program is part of the interactive
statistical system by Timo Salmi. It is the first program in the
set, and is intended for univariate statistical analysis. (For the
other programs in the set see Chapter 4.)
STAMEAS includes a built-in help system, which can be invoked by
typing ? at any interactive question. Because of this built-in help,
and the interactive nature of the program's user interface, no
long-winding instructions have been included. (Who reads
instructions anyhow?)
The program calculates for a set of observations in accordance
with the user's choice:
1) Standard statistical measures
2) Autocorrelations
3) Runs
and
4) Draws frequency diagrams.
The data can either be given from the keyboard or taken from a
file. If the input is to be taken from a file it must first be
prepared with some editor, or some word processor which includes an
option for preparing ordinary ascii text.
The data is given to the program in the following format:
X1 X2 X3 !variable names (! denotes a comment)
3.56 6.32 -1.73
5.12 -4.21 9.18
14.2 5.11 0.31
END !END is optional in a file
A missing item in an observation is marked by a hash (#). E.g. if
the first item of the second observation were missing, the
observation should be written as # -4.21 9.18
The items in an observation can be separated with blanks, as in
the above, or with commas (,) e.g. 5.12,-4.21,9.18. The number of
the intervening blanks is irrelevant, and can be customized for
increased readability.
Thus e.g. 5.12 -4.21 9.18
and 5.12 -4.21 9.18
are equivalent.
A row can be continued using an ampersand (&). E.g. the variables
could be given as
X1 X2 &
X3
Alternatively, * can be used instead of & as the continuation
marker.
Comments can be added to the input data. If ! appears on a line
all text after ! will be considered as a comment.
A header can be entered on each page if output is directed to a
file. To accomplish this start the very first line on the input file
with a double exclamation mark (!!) and the rest of the line will be
used as the header. Thus !! indicates a header, a single ! an
ordinary comment.
The maximum number of variables is 25. The maximum number of
observations is 400 (for each variable). The public domain version,
however, sets the limits at 4 and 100 respectively.
3. FORMULAE
This chapter gives a selected summary of the univariate
statistics formulas used in STATMEAS.
Denote
N = number of observations (of the relevant variable)
x(j) = value of the j'th observation (for the relevant variable)
Mean kx = Σ x(j) / N
____________________
Standard deviation s = √ Σ (x(j)-kx)²/ (N-1)
Variance s² = Σ (x(j)-kx)²/ (N-1)
Coefficient of variation = standard deviation / mean
3 3 ½
Skewness g(1) = Σ (x(j)-kx) / (Ns ) standard error (6/N)
4 ½
Kurtosis g(2) = Σ (x(j)-kx) / (Ns ) - 3 standard error (24/N)
N-lag
Σ (x(j+lag)-kx)(x(j)-kx)
j=1
Autocorrelation r(lag) = ----------------------------
N
Σ (x(j)-kx)²
j=1
_______________________
│ lag-1
Standard error std(r(lag)) = │ (1 + Σ 2 r(i)²) / N
√ i=1
t-value = r(lag) / std(r(lag))
lag
Ljung-Box Q = N(N+2) Σ r(i)² / (N - i)
i=1
Run statistics:
Denote n1 = number of negative changes
n2 = number of no changes
n3 = number of positive changes
n4 = n1 + n2 + n3
expected number of runs = (n4(n4+1) - n1² - n2² - n3²) / N
number of runs - expected number of runs + 0.5
Standardized k-value = ----------------------------------------------
std deviation of runs
4. THE OTHER STATISTICAL PROGRAMS IN THE SET
STATMEAS is the first statistical program in a set of programs
intended for analyzing statistical data. The second program in the
set is STATREGR (STATistics: multiple REGRession), which also has
been released. Plans include STATTRAN (STATistics TRANsformations)
for effecting transformations (such as logarithms, differences,
roots, etc.) on the data, and STATRCOR (STATistic Ranks and
CORrelations) which is intended for parametric and non-parametric
correlation analysis.
STATTRAN, STATRCOR, and STATLADR have now been released to
complete the set. Each of the programs appears in a different .zip
file. The names are:
TS1STxx.ZIP <-- STATMEAS
TS2STxx.ZIP <-- STATREGR
TS3STxx.ZIP <-- STATTRAN
TS4STxx.ZIP <-- STATRCOR
TS5STxx.ZIP <-- STATLADR
5. THE STATISTICS SET FOR OTHER COMPUTERS
The Statistical programs by Timo Salmi are (were) also available
for ill-fated the Sinclair QL computer. Named STATPREP the system is
part of a Public Domain library for QUANTA members. The descriptions
of the files in the Quanta Library are given below as taken directly
from a QL database.
The central part of the package has also been programmed for the
VAX/VMS by the author, but he input and output commands of the VAX
"REGRE" and "LADRE" programs are in Finnish, and the VAX versions
are not public domain. (LADRE is a Least Absolute Deviation multiple
REgression analysis.)
In Quanta Library:
STATPREP_DOC
Quill documentation of Timo Salmi's statistical package.
Associated programs STATPREP_EXE, STATTRAN_EXE, STATMEAS_EXE,
STATRCOR_EXE, STATREGR_EXE, STATDIAG_EXE, and STATMENU.
STATMENU
Besides EXEC_W'ing the statistical package programs you can
LRUN STATMENU and let it take care of the calls. Makes the
package still easier to use. Alter the default device(s)
(FDK1_) for your own configuration, if necessary.
STATPREP_EXE (statistics preparation editor)
An editor of a kind used for entering the data for the
associated statistical programs. Maximum 10 variables with 200
observations (each).
STATTRAN_EXE (transformations on variables)
Used for making transformations on observations prepared with
STATPREP_EXE. Transformations include taking logarithms,
raising to power, taking roots, scaling, and multiplying
between observations. Takes its input from a <statprep> file
prepared by STATPREP_EXE and creates a new <statprep> file with
the original and the transformed variables.
STATMEAS_EXE (statistical measures)
Standard statistical measures and frequency diagram for any of
the variables in the data. Measures include minimum, maximum,
mean, median, standard deviation, variance and kurtosis.
STATRCOR_EXE (ranks and correlations)
Calculates both the usual (Pearson's) correlation coefficients
and Spearman's rank correlation coefficients. Also gives, on
request, ranks for the observations of any of the variables.
Note that computing Spearman's rank correlations takes quite
awhile.
STATAURU_EXE (autocorrelations and runs)
Calculates autocorrelations up to the desired lag. Also
calculates runs statistics.
STATREGR_EXE (multiple regression)
Full-scale multivariate regression on the data prepared with
STATPREP_EXE (and STATTRAN_EXE).
STATDIAG_EXE (scatter diagram)
Draws scatter diagrams of the data prepared with STATPREP_EXE
(and STATTRAN_EXE). The user has the choice of the X-axis and
the Y-axis. If the user specifies a regression model (estimated
by STATREGR_EXE), the predicted (i.e. regressed) values can be
plotted on the scatter diagram. Furthermore, a scatter diagram
of the residual can be drawn against any of the variables if
the user has specified the regression model.
6. RELEASE NOTES FOR STATMEAS
Versions 1.1 and 1.2 of STATMEAS include some minor changes and
corrections in the user interface to make it fully compatible with
the other programs in the set.
Version 1.3: If the input file is not found, you have the choice
of listing directories from within STATREGR. The directory routine
has been rewritten for a more relaxed syntax. (For details see the
information on DIRW in TSUTIL.INF in TSUTILxx.ZIP version 1.8 or
later.)
Version 1.4: An attempt to rewrite a protected file no longer
crashes the program.
Version 1.5: This version introduces input recall and line
editing. The special keys Del, CrLf, CrRg, CrUp, Home, End, and Esc
are functional for this purpose. PgUp is the recall key. Line
editing uses insert mode.
Disk access has been made faster (the program has its own cache).
The directory routine has been updated.
Read-only files can be read properly.
Version 1.6: While fixing a bug in the multiple regression
program I also made some enhancements in the routines which are
common to all my statistical programs. Most importantly the insert
key has been made functional in the line editing.
Version 1.7: The line editing potential of the program has been
improved. When the task is given from the keyboard, and continuation
lines are used, the repeated input recall (CursorUp) gets each line
in turn. Ctlr-C and break-key abort have been enhanced.
A line can be continued using an &, or a * at the end. Now also the
backslash \ is accepted. This was suggested by Tuomas Eerola.
Some help texts within the program have been extended.
Version 1.8: Several improvements to the nuts and bolts of the
user interface.
The new usage of the call is
PROGNAME [/h(elp)] [/iInputFileName] [/oOutputFileName] [/cColumnsPerRow]
(the /c option, which regulates the width of the output, is for
registered versions, only). If you use the /i switch, it stuffs the
InputFileName into the appropriate recall buffer. This means that
when the program asks you for the input file name, you can invoke
the input file name just by pressing the CursorUp key. (The same
goes for the /o switch, respectively.) This is very convenient, if
you use the program many times successively making small changes in
your data in between. (This assumes, of course, that you have a
command line editor like DOSEDIT or CED to recall previous MsDos
commands. These common shareware programs can be obtained from any
well-stocked BBS or FTP site.)
The printer readiness test has been rewritten to be more general.
The earlier test failed for some printers, because the codes the
printers send when they are offline are not standardized.
The "file exists, overwrite?" question is no more asked when the
output file is prn, in other words when the output is directed to
the printer.
The user has now a choice of a left margin from 0 to 20 blanks
when output is directed to the printer.
The user has now a choice between formfeed and four blank lines
to start each new page of output.
When an input file is not found, the user is given the choice of
listing a directory. The directory routine has been rewritten.
The file ready message now also includes the file side besides
the name.
Version 1.9: The calculation of the coefficient of variation was
added.
Version 2.0: The input and output file names can be optionally
given as parameters in the program calls, e.g.
STATMEAS /ic:\stat\test.dat /or:\tmp
This option has been improved. The "prefilled" name (e.g.
c:\stat\test.dat) will now automatically appear on the input line
without the need of pressing the cursor up key. All you need to do
is to press enter.
Also made some minor internal changes not worth recording.
Rewrote the document files using a 68 column wrap instead of the
former 80 to make the text easier to read and handle. Added the list
of files in the package to the documentation.
7. LIST OF THE FILES IN THE PACKAGE
TS1ST Statistics by T.Salmi, Part I
Filename Comment
-------- --------------------------------
FILE_ID.DIZ Brief characterization of TS1ST
STATMEAS.EXE Statistical univariate measures
STATMEAS.INF Document
TSPROG.INF List of PD programs from T.Salmi
VAASA.INF Info: Finland, Vaasa, U of Vaasa
---- ------ ------ -----
0005